research papers 



CrossMark 



lUCrJ 

ISSN 2052-2525 



BIOLOGY MEDICINE 



Structural insights into Wcbl, a novel 
polysaccharide-biosynthesis enzyme 



Mirella Vivoli, Emily Ayres, Edward Beaumont, Michail N. Isupov and 
Nicholas J. Harmer* 



Received 14 June 2013 
Accepted 30 September 201 3 



Edited by J. L. Smith, University of Michigan, 
USA 



Keywords: capsular polysaccharides; acetyl- 
CoA; acetyltransferases; Burkholderia pseudo- 
mallei; melioidosis 

PDB references: Wcbl, bound to coenzyme A, 
4bqn; native Wcbl, 4bqo 

Supporting information: this article has 
supporting information at www.iucrj.org 



College of Life and Environmental Sciences, University of Exeter, Geoffrey Pope Building, Stacker Road, Exeter EX4 4QD, 
England. *Correspondence e-mail: n.j.harmer@exeter.ac.uk 

Capsular polysaccharides (CPSs) are protective structures on the surfaces of 
many Gram-negative bacteria. The principal CPS of the human pathogen and 
Tier 1 select agent Burkholderia pseudomallei consists of a linear repeat of 
-3)-2-0-acetyl-6-deoxy-/S-D-m<3««o-heptopyranose-(l-. This CPS is critical to 
the virulence of this emerging pathogen and represents a key target for the 
development of novel therapeutics, wcbl is one of several genes in the CPS 
biosynthetic cluster whose deletion leads to significant attenuation of the 
pathogen; unlike most others, it has no homologues of known function and no 
detectable sequence similarity to any protein with an extant structure. Here, 
the crystal structure of Wcbl bound to its proposed product, coenzyme A, is 
reported at 1.38 A resolution, solved using the halide-soak method with multiple 
anomalous dispersion. This structure reveals that Wcbl incorporates a 
previously described 100-amino-acid subdomain into a novel, principally helical 
fold (310 amino acids). This fold adopts a cradle-like structure, with a deep 
binding pocket for CoA in the loop-rich cradle. Structural analysis and 
biophysical assays suggest that Wcbl functions as an acetyltransferase enzyme, 
whilst biochemical tests suggest that another functional module might be 
required to assist its activity in forming the mature B. pseudomallei capsule. 




1 . Introduction 



13 




OPEN Q ACCESS 



The human pathogen Burkholderia pseudomallei is a Gram- 
negative soil-dwelling bacterium that causes a severe febrile 
disease, melioidosis. The organism is endemic throughout the 
tropics (Wiersinga et at, 2012) and is the most common form 
of community-acquired infection in parts of Thailand and 
Northern Australia. It has been reported on every continent 
except Antarctica (Stone, 2007). B. pseudomallei infects 
humans, a range of animals and some plants. Disease states in 
humans include acute and chronic forms, ranging from acute 
pneumonia or septicaemia to chronic abscesses (Lazar Adler 
et at, 2009); without the use of appropriate antibiotics, the 
septicaemic form of melioidosis has a mortality rate that 
exceeds 90% (Warner et «/., 2007). Many countries in which 
B. pseudomallei is endemic are located in less developed parts 
of the world, where bacteria cause disease in large numbers of 
people. It is believed that between 20 and 50% of all patients 
diagnosed with severe forms of melioidosis will die, depending 
on their access to appropriate medicines and healthcare, which 
is often not sufficient in less developed countries. Even with 
satisfactory healthcare, relapse of the disease is a common 
problem, with many patients requiring treatment for the rest 
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of their lives (Lazar Adler et al, 2009). Because of its high 
mortality rate, B. pseudomallei is considered as a potential 
bioterror threat and is currently classed by the US Centers for 
Disease Control (CDC) as a Tier 1 select agent (Department 
of Health and Human Services, 2012). B. pseudomallei is 
naturally resistant to many antibiotics, and has no currently 
available vaccine. An effective vaccine to both protect popu- 
lations exposed to infection and to alleviate the risk of 
bioterrorism is thus urgent. 

The principal capsular polysaccharide (CPS-I) of B. pseudo- 
mallei is one of the best verified virulence factors of this 
organism (Lazar Adler et al, 2009; Wiersinga et al, 2006) and 
is considered to be a strong candidate for vaccine develop- 
ment. Ablation of CPS-I abolishes B. pseudomallei virulence 
in animal models (Reckseidler et al, 2001; Atkins, Prior, Mack, 
Russell, Nelson, Prior et al, 2002), and CPS provides partial 
protection when used to vaccinate mice against melioidosis 
(Nelson et al, 2004). Most strains of the non-pathogenic close 
relative organism B. thailandensis do not have the genes 
required for CPS biosynthesis (Reckseidler et al, 2001), and 
live vaccines consisting of mutants lacking a polysaccharide 
capsule have been shown to provide only partial protection 
against melioidosis (Atkins, Prior, Mack, Russell, Nelson, 
Oyston et al, 2002; Sarkar- Tyson et al, 2009; Stevens et al, 
2004; Breitbach et al, 2008). 
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CPS-I of B. pseudomallei and the related Tier 1 select agent 
B. mallei consists of a linear polymer of -3)-2-0-acetyl- 
6-deoxy-j6-D-»?fl««o-heptopyranose-(l- (2Ac-dDHep; Reck- 
seidler et al, 2001; Heiss et al, 2012). This polysaccharide is 
biosynthesized by a cluster of 24 genes found on B. pseudo- 
mallei chromosome 1 (Reckseidler et al, 2001; DeShazer et al, 
2001; Holden et al, 2004). The function of the majority of 
these has been proposed from bioinformatics and functional 
studies (Cuccui et al, 2012). However, three genes (wcbF, 
wcbG and wcbl) do not have proposed functions, and the 
protein responsible for 2-O-acetylation of the capsule sugars 
has not been identified (Fig. 1). wcbl has been shown to be 
required for virulence (DeShazer et al, 2001; Cuccui et al, 
2007). In addition, the location of the gene in this cluster, and 
the diminished capsule manufacture in its absence, suggest 
that this gene is necessary for efficient CPS-I production. 
BLAST searches of the sequence do not reveal any homo- 
logues with predicted functions, and a series of fold- 
recognition servers give only equivocal matches to known 
structures. 

To determine the biochemical function of Wcbl, we have 
therefore solved its crystal structure to 1.38 A resolution. The 
structure revealed that Wcbl adopts a predominantly helical 
fold: the N-terminal 100 amino acids form a ligand-binding 
domain fold, with the remaining 210 amino acids adopting an 

entirely novel fold. The protein 
binds tightly to coenzyme A and 
its derivative acetyl-CoA, with an 
apparent k d of 58 \iM by differ- 
ential scanning fiuorimetry. Our 
extensive biochemical assays 
failed to show acetyltransferase 
activity for Wcbl when incubated 
with any available substrate. 
Based on the structure and on 
biochemical and biophysical data, 
we propose that Wcbl functions 
as an acetyltransferase and 
requires another functional 
module to carry out this function. 



2. Materials and methods 



OH 




Figure 1 

Proposed pathway for the biosynthesis of 2-0-acetyl-6-deoxy-/3-D-manno-heptopyranose from sedoheptu- 
lose 7-phosphate in B. pseudomallei (after Harmer, 2010). 



2.1. Protein cloning, expression 
and purification 

Full-length wcbl from B. 
pseudomallei strain K96423 (the 
genomic DNA was a gift from 
R. Titball, University of Exeter) 
was cloned into pNIC28-Bsa4 
(Savitsky et al, 2010; a gift 
from O. Gileadi, SGC Oxford). 
Constructs were transformed into 
Escherichia coli Rosetta (DE3) 
cells (Merck) and grown in ZYM- 
5052 medium (Studier, 2005) 



lUCrJ (2014). 1, 28-38 



Mirella Vivoli et a/. ■ Structural insie hts into Wcbl 29 



research papers 



Table 1 

Data-collection and refinement statistics. 

Values in parentheses are for the outer resolution shell. 
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supplemented with 100 ug ml kanamycin and 20 pg ml 
chloramphenicol. Cells were grown at 37°C until the OD 600 
reached 0.5 and then at 20°C for 16 h. The harvested cells were 
resuspended in 20 mM Tris-HCl pH 8.0, 0.5 M NaCl (buffer 
A) and lysed using a Soniprep 150 sonicator (MSE). The 
clarified lysate was purified using a nickel-agarose column 
(Bioline) or a HisTrap Crude FF column (GE Healthcare). 
Briefly, the loaded protein was washed with buffer A supple- 
mented with 25 mM imidazole-HCl pH 8.0 and eluted with 
buffer A supplemented with 250 mM imidazole-HCl pH 8.0. 
Wcbl was then loaded onto a Superdex 200 16/60 HR column 
(GE Healthcare) and eluted isocratically with 10 mM HEPES 
pH 7.0, 0.5 M NaCl. For preparation of tag-free Wcbl, purified 
protein was incubated at 4°C for 48 h with 1:100 TEV protease 
(S219V mutant; Kapust et al, 2001; purified using methods 
available at http://mcll.ncifcrf.gov/waugh_tech/protocols/ 
pur_histev.pdf). The sample was then extensively loaded over 
nickel-agarose to remove residual undigested Wcbl and TEV. 

2.2. Crystallization and structure determination 

All crystals were grown using the microbatch method and 
were prepared using an Oryx 6 crystallization robot (Douglas 
Instruments). Crystals grew in 2.5 mg ml" 1 Wcbl, 10%(uVv) 
PEG 8000, 75 mM LiCl, 75 mM MgCl 2 , 0.05 M HEPES pH 
7.0-8.0 at 4°C. Prior to flash-cooling, crystals were soaked in 
cryoprotectant solutions consisting of 22%(v/v) PEG 400, 



8%(v/v) PEG 8000, 1 M NaBr, 0.05 M HEPES pH 7.0, 7.5 and 
8.0 for 5 min. Co-crystallization with acetyl-CoA was achieved 
by growing crystals in the same conditions, with Wcbl at 
2 mg ml - , supplemented with 200 uM acetyl-CoA. These 
crystals were cryopreserved in the above solution without 
sodium bromide. The Wcbl structure was solved using the 
multiple anomalous dispersion (MAD) method following the 
short (<5 min) cryosoaking of a crystal with bromide ions 
(Dauter et al, 2000). Diffraction data were collected from a 
bromide-soaked crystal on beamline 102 at the Diamond Light 
Source synchrotron at four wavelengths (Table 1) using 
an ADSC detector. Data were processed with XDS (Kabsch, 
2010) and scaled together using SCALA (Evans, 2011) and the 
xia2 (Winter, 2010) pipeline. Most of the further data and 
model manipulations were carried out using programs from 
the CCP 4 suite (Winn et al, 2011). Anomalous scatterers were 
located and a polyalanine model of the protein was built using 
the SHELXCIDIE pipeline (Sheldrick, 2010) implemented in 
HKL2MAP (Pape & Schneider, 2004). A full atomic model 
was built using the ARPIwARPIREFMAC5 pipeline (Langer 
et al, 2008; Murshudov et al, 2011). High-resolution data were 
collected from the acetyl-CoA-soaked crystal on beamline 124 
at Diamond Light Source using a PILATUS detector. The 
final refinement of both models was conducted with PHENIX 
(Adams et al, 2010). The model was manually rebuilt in Coot 
(Emsley et al, 2010) and the final model was validated using 
PHENIX and MolProbity (Chen et al, 2010). The statistics of 
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the refinement and the stereochemistry of the final models are 
summarized in Table 1. The coordinates and structure factors 
were deposited in the Protein Data Bank under accession 
codes 4bqn for Wcbl covalently bound to coenzyme A and 
4bqo for the native structure. Figs. 2(a), 2(d), 2(f), 3(a), 3(c) 
and Supporting Figs. S2(a), S2(c) and S3(a) were prepared 
with the PyMOL Molecular Graphics System (Schrodinger), 
and Fig. 3(b) and Supporting Figs. S2(fo) and S3(£>) were 
prepared with LigPlot (Laskowski & Swindells, 2011). The 
stereoview of the Wcbl C" backbone (Fig. 2b) was prepared 
using the program BobScript (Kraulis, 1991; Esnouf, 1997) and 
Fig. 2(c) was prepared with TOPS (Michalopoulos et al, 2003). 

2.3. Differential scanning fluorimetry (DSF) 

DSF samples were prepared at the following concentrations 
in a final volume of 20 ul: 0.1 mg ml - protein, 10 mM HEPES 
pH 7.0, 150 mM NaCl, 8x SYPRO Orange dye (Invitrogen; 
Niesen et al, 2007). Potential ligands were added to final 
concentrations of between 0.1 \iM and 1 mM. All samples 
were prepared in triplicate. Fluorescence was measured using 
a StepOne quantitative PCR machine (Applied Biosystems) 
while heating the samples in a gradient from 25 to 99° C over 
40 min. Measurements were taken every 0.37° C. The DSF 
curves showed a mixture of two populations of protein. Rather 
than calculating the midpoint temperature of the unfolding 
transition (T m ), the curves were fitted to determine the 
proportions of protein in the bound and unbound states. A full 
description of this approach will be published shortly (Vivoli 
& Harmer, in preparation). Briefly, the differential of each 
DSF curve was calculated using the Protein Thermal Shift 
software package (Applied Biosystems), and each curve was 
compared with the unbound sample and a sample at the 
highest ligand concentration to determine the proportion in a 
bound state using SPSS v.20. The dissociation constant k d was 
calculated using 

proportion bound = 

[CoA] + E 0 + k d - {([CoA] + E 0 + k A f - (4E 0 [CoA])} 1/2 

2E 0 

(1) 

E 0 was treated as a constant, and the value of k d was deter- 
mined using SPSS v.20. Following the calculation of k d , a 
standard curve was modelled using this equation. The likely 
proportion bound in the highest ligand concentration sample 
was calculated and a differential curve at 100% ligand 
concentration was modelled. The determination process was 
then repeated until the model converged. 29 concentrations of 
CoA were used in at least triplicate, and the results reported 
are representative of results obtained on at least three days. 

2.4. Acetyltransferase kinetic assays 

2.4.1. Assay of coenzyme A production by coupling to 
5'-dithio-bis(2-nitrobenzoic acid). The activity assay was 
based on the hydrolysis of acetyl-CoA in the presence of the 
substrates guanosine diphosphate mannose (GDP-mannose) 



or mannotetraose. 5'-Dithio-bis(2-nitrobenzoic acid) (DTNB) 
was used as a colorimetric developing agent for the detection 
of CoA (Shaw, 1977). Acetyl-CoA (0-1 mM), GDP-mannose 
(4 mM) or mannotetraose (2 mM) and 0.5 \iM purified 
recombinant Wcbl were mixed and pre-incubated at room 
temperature for 5 min in a 96-well plate in a final volume of 
50 ul containing 100 mM Tris-HCl buffer pH 8.0. The reaction 
was stopped with 25 ul guanidine hydrochloride solution 
(6.4 M guanidine-HCl, 0.1 M Tris-HCl pH 8.0) and incubated 
for 3 min, gently shaking. Finally, 25 ul 10 mM DTNB was 
added to the stopped reaction and the absorbance at 412 nm 
(peak of absorbance of 2-nitro-5-thiobenzoic acid; TNB) was 
measured. Rates were calculated using an extinction coeffi- 
cient for TNB at 412 nm of 13 600 M _1 crrT 1 . Reactions in 
which GDP-mannose, mannotetraose, acetyl-CoA or recom- 
binant Wcbl were omitted served as controls. The same assay 
was performed using 20 U chloramphenicol acetyltransferase 
(Sigma) and 2 mM chloramphenicol for a positive control. 

2.4.2. Assay of coenzyme A production using fluorescence. 
The assay for the determination of acetyltransferase activity 
was carried out using the acetyltransferase activity kit (Enzo 
Life Sciences) as per the manufacturer's instructions. Briefly, 
in a 96-well plate, 0-3 mM donor substrate (GDP-mannose) 
was diluted in lx reaction mixture. An enzyme mixture 
consisting of 25 ul 1 x transferase assay buffer and 25 ul 
0.5 uM Wcbl was added to this. The plate was covered with a 
foil sealer and incubated for 5 min with constant shaking in an 
orbital shaker at room temperature. 50 ul positive control 
(supplied in the kit) was added to the appropriate wells. 50 ul 
ice-cold isopropyl alcohol was then added to each well. Finally, 
100 ul of the lx detection solution (supplied in the kit) was 
added to each well. The plate was covered with a foil sealer 
and further incubated for 10 min at room temperature without 
shaking. The plate was then read at 380 nm excitation and 
520 nm emission wavelengths. The test was performed in 
duplicate. The signal-to-noise ratio was calculated from the 
relative fluorescence unit of a blank with buffer alone. 

2.4.3. Assay of coenzyme A production using a coupled 
reaction with pyruvate dehydrogenase. Acetyltransferase 
activity was monitored continuously using a Tecan Infinite 200 
microplate reader as described previously (Kim et al, 2000). 
Briefly, the coenzyme A generated in a Wcbl reaction was 
continuously measured using a coupled system with pyruvate 
dehydrogenase; the CoA-dependent oxidation of pyruvate is 
accompained by the reduction of NAD + to NADH, which was 
measured spectrophotometrically at 340 nm. Reactions were 
performed in 50 ul and consisted of 10 mM HEPES pH 7, 
0.2 mM NAD + , 0.2 mM thiamine pyrophosphate, 5 mM 
MgCl 2 , 1 mM dithiothreitol, 2.5 mM pyruvate and 0.03 U 
pyruvate dehydrogenase (Sigma-Aldrich), 1 mM GDP- 
mannose or 1 mM mannotetraose, 0.5 uM recombinant Wcbl 
and 0-500 \iM acetyl-CoA. Assays were performed in dupli- 
cate at 37° C, mixing all reagents without Wcbl for 3 min to 
exhaust endogenous CoA; after this, Wcbl was added and the 
reaction was followed for a further 5 min. The same assay was 
performed using 20 U chloramphenicol acetyltransferase 
(Sigma) and 1 mM chloramphenicol as a positive control. 
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3. Results 

3.1. Phasing 

The crystal structure of Wcbl was determined by the 
multiple-wavelength anomalous dispersion method, using data 
collected at four wavelengths. Peak, inflection and high-energy 
remote data were collected to a limited resolution with short 
exposures and low transmission in order to minimize possible 
radiation damage and to collect multiple equivalents. As the 
Wcbl crystals belonged to space group PI, a high-energy 
remote data set was collected over a rotation interval of 360° 
to increase the data redundancy. No radiation damage to the 
crystal was apparent after the first three data sets, so the 
exposure times were increased for the low-energy remote data 
set, which was collected over an interval of 360° to a high 
resolution of 1.56 A. As the crystal proved to be stable to 
X-rays and diffracted well, the reflections from corners of the 
detector were used in anomalous phasing, as the experiment 
geometry was the same for the different data sets. In spite of 
the low reported completeness of the peak, inflection and 
high-energy remote sets (Table 1), these data were over 90% 
complete to 2.7 A resolution. The coordinates of 24 Br ions 
were used for phasing. These were located using 1000 attempts 
by SHELXD at 2.4 A resolution with a correlation of 27.3% 
(20.5% at high resolution). The high resolution of the low- 
energy remote data set was crucial for subsequent density 
modification and successful building of the Wcbl polyalanine 
model (605 residues out of 622) by SHELXE. For the correct 
hand, the correlation for the partial structure against the 
native data was 36.5 X. 1 The full Wcbl atomic model was built 
by ARPIwARP using the polyalanine model produced by 
SHELXE and the low-energy remote data. This was found to 
contain bound CoA, carried through purification from the 
expression host, and will henceforth be termed the Wcbl 
native structure. A second crystal was obtained co-crystallized 
with substrate acetyl-CoA, and high-resolution complex data 
were collected. The statistics of data collection and of model 
refinement and validation are presented in Table 1. 

3.2. Quality of the models 

The structure of the CoA complex of Wcbl and the native 
structure were refined by PHENIX and CCPA, respectively, to 
resolutions of 1.38 and 1.56 A, respectively. For each model 
the final round of refinement resulted in acceptable values of 
the R factor and i? free - The Wcbl crystals contained two 
independent molecules in the unit cell. Some residues were 
not modelled owing to poor electron density at the N-termini. 
The G-factors calculated for both models confirmed that the 
structures have normal stereochemical properties as given by 
PROCHECK (Laskowski et al., 1993). The Ramachandran 
plots of the models (Chen et al., 2010) revealed that at least 
97% of the residues lie in the most favoured regions. The 
native structure contains 38 bromide ions. These were located 
using anomalous difference synthesis with peak Br data and 
phases from the refined model. Cofactor molecules were 

1 Correlations of over 30% at this stage generally indicate a correct solution. 



positioned in the active site using F Q — F c OMIT maps and the 
occupancies of these molecules and the bromides were refined 
by PHENIX. In both subunits of both structures Ile244 is the 
sole Ramachandran plot outlier. This residue is well defined in 
the electron density and the B factors of its atoms are low. 
Many residues were modelled with alternative conformations 
of their side chains. For several regions in both structures the 
main chain was modelled in an alternative conformation. 
There are no Wcbl residues with a cis conformation. 

3.3. Overall structure 

The triclinic unit cell of the Wcbl crystal contains two 
molecules (residues 2-310 for chain A and residues 4-310 for 
chain B). Comparison of the structure with extant structures 
using the protein structure comparison services DALI 
(http://ekhidna.biocenter.helsinki.fi/dali_server/; Holm & 
Rosenstrom, 2010) and PDBeFold (http://www.ebi.ac.uk/ 
msd-srv/ssm; Krissinel & Henrick, 2004) revealed that Wcbl 
largely adopts a novel fold. The fold consists of two sub- 
domains (Figs. 2a, 2b and 2c). The N-terminal lobe (amino 
acids 2-102) forms a four-stranded parallel /8-sheet with 
connectivity — lx, +2x, +lx (Richardson, 1981) flanked on one 
side by two a-helices (al-a2) and on the other by three a- 
helices (a3-a5). This arrangement has previously been 
observed in a number of proteins with different functions, for 
example Neisseria meningitidis glycerate kinase (New York 
SGX Center for Structural Genomics, unpublished work; PDB 
entry lto6) and the zinc-transport protein ZnuA (Banerjee et 
al., 2003; PDB entry lpq4). Most of these proteins have a 
ligand/metal-binding site in the crevice formed at the C- 
termini of /81 and /33 of the /8-sheet. Cysl4 of Wcbl is located 
in this position. The C-terminal subdomain (amino acids 103- 
311) is a novel arrangement of several a-helices that pack 
against the N-terminal domain (Fig. 2a). Analytical size- 
exclusion chromatography (Supporting Fig. SI) indicates that 
Wcbl is likely to be monomeric in solution. The two molecules 
are related by an approximate twofold axis with a rotation 
angle of 175°; however, the buried surface area at the interface 
is only 140 A 2 , which is less than 4% of the total accessible 
monomer surface area. Analysis of the structure with PISA 
did not indicate any likely multimerization interfaces 
preserved in the crystal. 

3.4. Binding of acetyl-CoA and CoA to Wcbl 

Analysis of early crystals of Wcbl revealed that an 
unidentified ligand had been partially retained with the 
protein from expression in E. coli through the purification 
process (Supporting Fig. S2«). Mass spectrometry clearly 
identified CoA as the ligand co-purified with Wcbl (data not 
shown). We therefore co-crystallized Wcbl with acetyl-CoA. 
CoA was clearly defined in the electron-density map (Fig. 3a) 
in both protomers, whilst acetyl-CoA was only observed in 
protomer A. However, following occupancy refinement, the 
occupancy of the acetyl moiety was too low for meaningful 
modelling. The molecule is located in a funnel-shaped cleft, 
surrounded by loops. The bound cofactor adopts a U-like 
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The structure of Wcbl reveals a novel fold, (a) Cartoon representation of the crystal structure of Wcbl in two orientations. Termini and secondary- 
structure elements are labelled. For the N-terminal subdomain a-helices and loops are shown in red and /!-sheets in yellow; for the C-terminal subdomain 
a-helices and loops are shown in blue and /i-sheets in green, (b) Stereoview of the C™ backbone in the same orientation as in the left-hand image of (a). 
Every tenth residue is numbered, (c) Topological diagram of the polypeptide fold with a-helices indicated by circles and /3-strands by triangles coloured 
as in (a), (d) Cartoon representation of Wcbl in the orientations in (a) coloured as a rainbow with the N-terminus in blue and the C-terminus in red. (e) 
Schematic diagram of the fold of Wcbl. a-Helices and 3 10 -helices are shown as cylinders and /i-sheets are shown as arrows. Secondary -structure elements 
are coloured as in (a 1 ). The start and end residue of each element is indicated. (/) Cartoon representation of the novel fold C-terminal subdomain 
coloured as a rainbow with the N-terminus in blue and the C-terminus in red. The substrate coenzyme A is shown as spheres. Carbon, cyan; oxygen, red; 
nitrogen, blue; sulfur, yellow; phosphorus, orange. Interacting residues from the novel subdomain are shown as lines. The secondary-structure 
assignments in all panels were made according to TOPS (Michalopoulos et al, 2003). (a), (f>), (d) and (/) were prepared with the PyMOL Molecular 
Graphics System (Schrodinger) and (c) was prepared with TOPS. 
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overall conformation that is stabilized by several hydrogen 
bonds to Wcbl, electrostatic interactions with charged resi- 
dues, and hydrophobic contacts (Fig. 36, Supporting Fig. S2b). 
In both protomers, the /J-mercaptoethylamine group of CoA 
forms a disulfide bond to the residue Cysl4 (Figs. 36 and 3c), 
whilst the acetyl-CoA S atom is positioned near, but slightly 
away, from this and its carbonyl group is hydrogen-bonded to 
the S atom of Cysl4 and His201 (data not shown). Interest- 
ingly, in the native structure the CoA terminal SH group 
points away from Cysl4 toward Cys41 (Supporting Fig. S2a), 
perhaps owing to a more reducing environment of the crys- 




3ksn200 



tallization conditions. Water molecules and Tyr97 form 
hydrogen bonds to the 4-phosphopantothenic acid portion of 
the acetyl-CoA (Fig. 3c, Supporting Fig. S2c). The side chain 
of Arg294 is in electrostatic contact with the 3'-phosphate 
group; the amide N atoms of Ser69, Argl67 and Asnl72 form 
hydrogen bonds to the 5'-phosphate group. Finally, water 
molecules form hydrogen bonds to the ADP moiety, which is 
also involved in a cation-jr interaction with the guanidinium 
group of Argl67 (Fig. 3c, Supporting Fig. S2c). The extensive 
interactions of CoA with both the N-terminal and C-terminal 
domains is in contrast to the GNAT acetyltransferases that 
superficially resemble the N-terminal domain of Wcbl, in 
which CoA is intimately bound to a single domain (Supporting 
Fig. S3). 



3.5. Ligand- and cofactor-binding study by differential 
scanning fluorimetry 

To verify that CoA is a genuine ligand for Wcbl, we 
investigated its binding using differential scanning fluorimetry 
(DSF; Niesen et at, 2007). Firstly, the protein was extensively 
dialysed in the presence of reducing agents to remove any 
disulfide-bonded CoA from the protein. The apoprotein was 
then incubated with a range of CoA concentrations and the 
thermal stability of the protein was probed. The presence of 
CoA showed a marked stabilizing effect, raising the apparent 
melting temperature by 10°C (Fig. 4a), whilst a range of other 
common chemicals found in bacteria (after Niesen et at, 2007) 
failed to give any significant change (data not shown). Fitting 
of the data (Fig. 4b) indicated a dissociation constant k d of 58 
± 2 [iM, which is well within the normal range of bacterial 
acetyl-CoA concentrations (Takamura & Nomura, 1988). 



(b) 




Figure 3 

Wcbl binds to coenzyme A. (a) Electron-density map showing the binding of coenzyme A (CoA) to Wcbl. Wcbl is shown as a cyan cartoon, with CoA as 
sticks. For clarity, water molecules are not shown. An F 0 — F c OMIT map is shown contoured at 3a. The bond from CoA to Cysl4 is shown, (b) Schematic 
drawing of the Wcbl-CoA interactions, (c) Key Wcbl-CoA interactions. Wcbl is shown as a cyan cartoon, with interacting side chains as purple sticks. 
CoA is shown as sticks. Hydrogen-bond distances are shown. Nitrogen, blue; oxygen, red, phosphorus, orange; CoA carbon, yellow; Wcbl carbon, purple. 
(a) and (c) were prepared with the PyMOL Molecular Graphics System (Schrodinger) and (£>) was prepared with LigPlof (Laskowski & Swindells, 
2011). 
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Table 2 

Thermodynamic analysis of the binding of sugars, sugar nucleotides and 
CoA to Wcbl by DSF experiments. 



Ligands (1 mM) 


T m ± SDt (°C) 


No ligand 


37.4 ± 0.4 


CoA 


46.2 ± 0.2 


Acetyl-CoA 


45.6 ± 0.5 


D-Mannose 


37.0 ± 0.1 


D-Maltose 


37.1 ± 0.1 


D-Lactose 


37.5 ± 0.3 


D-Arabinose 


37.0 ± 0.1 


D-Lyxose 


37.1 ± 0.1 


Cellubiose 


37.7 ± 0.2 


Polymannose 


36.2 ± 0.1 


GDP 


37.0 ± 0.1 


UDP 


36.9 ± 0.05 


GDP-mannose 


38.2 ± 1.0 



f The melting temperature T m is expressed as the mean ± SD and was determined in 
triplicate. The concentration of Wcbl was 0.11 mg ml . 

As wcbl is essential for the virulence of B. pseudomallei 
(Cuccui et al, 2007), and the activity to acetylate the CPS has 
not been identified (Cuccui et al, 2012), we hypothesized that 
Wcbl might be this acetylase. To this end, using DSF, we 
investigated whether Wcbl might bind to any potential 
substrates. We tried a range of simple sugars (D-mannose, 
D-maltose, D-lactose, D-arabinose and D-lyxose), sugar poly- 
mers (cellubiose and polymannose) and nucleotides and sugar 
nucleotides (GDP, UDP and GDP-mannose). In each case 
there was no apparent alteration in the melting temperature 
(Table 2). It was apparent that it would not be possible to 
identify a potential substrate in this manner. 

3.6. Acetyltransferase kinetic assays 

To determine whether enzymatic assays could assist in 
determining a potential substrate for Wcbl, we used several 
different assays to assess the capacity of the enzyme to 
acetylate GDP-mannose or mannotetraose. These were 
selected as excellent analogues for the most likely substrates 
for Wcbl: GDP-mannose is highly similar to GDP-6-deoxy-/S- 
D-maMMO-heptopyranose, differing by one -CH 2 - group; whilst 
mannotetraose is a reasonable analogue of the polymerized 
sugar (having similar sugar units but a slightly different 
linkage). We attempted three chemically different assays for 
acetyltransferase activity. A DTNB-based assay (Riddles et al, 
1983), which measures the level of free thiols (and hence spent 
CoA), was performed. No significant acetylase activity was 
detected using GDP-mannose or mannotetraose as acceptor 
substrates. Whilst small changes in the product absorbance 
were observed, these were comparable to those seen in a 
bovine serum albumin negative control. Indeed, the back- 
ground reaction rate increased with increasing acetyl-CoA or 
recombinant Wcbl concentrations. Wcbl carries over some 
CoA from purification, apparently forming a covalent thiol 
adduct with Cysl4, as the crystal structures attest (Fig. 3a). 
Similar adducts resulted in a catalytically inactive form in 
the case of glucosamine-6-phosphate /^-acetyltransferase 1 
(Dorfmueller et al., 2012). This enzyme can be reactivated 
under reducing conditions with increasing amounts of redu- 



cing agents. To ascertain whether addition of reducing agents 
could rescue Wcbl activity, we repeated this assay in the 
presence of DTT Although there was a greater increase in the 
absorbance at 412 nm, this was also observed in controls and is 
likely to arise from the seven free cysteines of Wcbl and the 
free CoA contaminant reacting with the DNTB molecule, 
suggesting that this assay is not suitable for this purpose. 
However, as expected, in the positive control performed using 
chloramphenicol acetyltransferase (CAT) the assay clearly 
shows a significant amount of CAT enzyme activity compared 
with Wcbl (Fig. 5a). 

We therefore performed a second assay, specifically 
measuring free CoA using a fluorescent detection kit. Using a 
concentration range of GDP-mannose from 0 to 3 mM and 
constant Wcbl and acetyl-CoA, there was no evidence for the 
transfer of the acetyl group to the substrate GDP-mannose 
compared with a control reaction (Fig. 5b). A third, coupled 
assay was therefore performed. In this assay, spent CoA is 
turned over by pyruvate dehydrogenase, with concomitant 
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Figure 4 

Biophysical characterization of Wcbl binding to CoA. (a) Differential 
scanning fluorimetry thermal denaturation curves for Wcbl show a 
biphasic response. With no CoA (black line) or a high saturation (1 mM) 
of CoA (grey line), a monophasic response is observed. At intermediate 
concentrations of CoA (60 uM; dashed black line), a biphasic curve 
representing some bound and some unbound Wcbl is observed, (b) Wcbl 
shows dose-dependent binding to CoA, as expected for a single tight- 
binding molecule. All experiments were performed in triplicate and the 
data are representative of at least three experiments. The black line 
represents the fit of the data according to (1). 
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Figure 5 

Wcbl shows no detectable acetyltransferase activity, (a) Acetyltrans- 
ferase activity was measured using the DTNB assay. Each column 
represents an average of three measurements. The concentrations of 
acetyl-CoA and GDP-mannose were held constant at 1 and 4 mM, 
respectively. The same assay performed with chloramphenicol acetyl- 
transferase and 2 mM chloramphenicol served as a positive control. 
Reactions in which recombinant Wcbl and CAT were omitted served as a 
negative control, (b) Acetyltransferase activity was measured using a 
fluorescence-based activity assay. Concentrations of GDP-mannose 
ranging from 0 to 3 mM were used, with the Wcbl concentration held 
constant. The positive control supplied in the kit was used according to the 
manufacturer's instructions, (c) Acetyltransferase activity was measured 
using a coupled assay with pyruvate dehydrogenase. Substrate concen- 
trations were 0.5 mM for acetyl-CoA and 1 mM for GDP-mannose. The 
same assay was performed using CAT and 1 mM chloramphenicol as a 
positive control. A negative control was performed without either 
enzyme. The rates were calculated using the extinction coefficient for 
NADH. All columns represent the average ± standard deviation of two 
measurements. 



reduction of NAD + to NADH. This reaction can be followed 
spectrophotometrically at 340 nm in real time. The assay 
performed well in the presence of a test acetyltransferase 
(chloramphenicol acetyltransferase). However, using Wcbl, 
no activity was observed using acetyl-CoA and either GDP- 
mannose or mannotetraose as substrates (Fig. 5c). Further- 
more, computational approaches based on analysis of protein 
sequence and structure, such as the 3DLigandSite (Wass et ai, 
2010), LIGSITE esc (Huang & Schroeder, 2006), ConCavity 
(Capra et ai, 2009) or FINDSITE (Brylinski & Skolnick, 2008) 
servers, have been employed to predict potential further 
protein functional sites, including ligand-binding sites. All of 
these prediction servers confirmed the presence of a binding 
pocket for CoA, but did not identify any Wcbl surface region 
suitable for interactions with sugars or sugar-nucleotides. 

4. Discussion 

The B. pseudomallei capsular polysaccharide consists of a 
linear repeat of -3)-2-0-acetyl-6-deoxy-/S-D-ma««o-hepto- 
pyranose-(l- The genes required for the biosynthesis of this 
polysaccharide have been mapped in B. pseudomallei (Reck- 
seidler et ai, 2001) and B. mallei (DeShazer et at, 2001). Six 
genes {web J, wcbK, wcbL, gmhA, wcbM and wcbN) form a 
unit that is proposed to synthesize the precursor sugar GDP-6- 
deoxy-/J-D-ma««o-heptopyranose (dDHep) from sedoheptu- 
lose 7-phosphate, a pentose phosphate pathway intermediate 
(Cuccui et ai, 2012; Fig. 1). Nevertheless, the protein(s) that 
acetylates either GDP-6-deoxy-heptose or the partially poly- 
merized capsule has not been identified. To date, Wcbl has no 
predicted function (DeShazer et ai, 2001; Reckseidler et ai, 
2001), but has been shown to be required for virulence 
(Cuccui et ai, 2007) and for the production of the capsule 
immunogen (Cuccui et ai, 2012). 

The crystal structure of Wcbl demonstrated that this 
protein largely adopts a novel fold (Fig. 2). The fold of the 
N-terminal domain (amino acids 2-102) is common to several 
extant protein structures involved in signalling, ligand binding 
and nucleotide chemistry. The remaining 208 amino acids form 
an extended, largely a-helical structure, although a short 
portion also packs onto the N-terminal domain. It is striking 
that this fold generates a helical backbone that presents a 
loop-rich region at the interface between this domain and the 
N-terminal domain. The outcome is a large cleft between the 
two domains, with the promise of molecular flexibility to aid 
the ingress and egress of substrates, and perhaps to allow some 
remodelling of the binding site. 

Surprisingly, our crystal structure of Wcbl showed that the 
protein carries a ligand molecule through purification from 
E. coli (Supporting Fig. S2). Differential scanning fluorimetry 
demonstrates that this molecule is likely to be coenzyme A, as 
the presence of this molecule increases the apparent melting 
temperature of Wcbl by approximately 10°C (Table 2, Fig. 4a). 
The apparent k d of 58 \iM is well within the physiological 
range for most bacteria, implying that this is a reasonable 
candidate for the substrate of the protein. Co-crystallization 
with 200 \iM acetyl-CoA led to almost complete occupancy of 
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the ligand, with all CoA ligand atoms visible in the density. 
Although there is low occupancy of the acetyl group in the 
structure, the functional groups of acetyl-CoA form an 
extensive set of interactions with the protein, supporting the 
premise that this is a bona fide ligand of Wcbl. It is therefore 
not unreasonable to hypothesize that Wcbl could provide 
the missing acetyltransferase activity. A structural similarity 
search did not yield matches with other known acetyl- 
transferases; the highest scoring PDBeFold hit (Q = 0.15; PDB 
entry 4h08, a putative hydrolase; Joint Center for Structural 
Genomics, unpublished work) and DALI hit (Z = 9.8; PDB 
entry 3hg7, a putative dehydrogenase; New York SGX 
Research Center for Structural Genomics, unpublished work) 
have three helices in a similar conformation but otherwise 
have no similar structure. Comparison with previously 
described GNAT acetyltransferases demonstrated that Wcbl 
folds and binds CoA in a very different manner (Supporting 
Fig. S3). 

To pursue this hypothesis, we performed kinetic and 
biophysical assays. In contrast to the results observed with 
CoA (Fig. 4a), no thermal shifts were observed in the presence 
of potential substrates such as simple sugars (D-mannose, 
D-maltose, D-lactose, D-arabinose and D-lyxose), sugar poly- 
mers (cellubiose and polymannose) or nucleotides or sugar 
nucleotides (GDP, UDP and GDP-mannose; Table 2). This 
was the case either in the presence or in the absence of acetyl- 
CoA, indicating that the binding of the first substrate does not 
create a site for a secondary substrate. These data do not 
unequivocally eliminate the possibility of a second substrate 
binding: there are cases in which a ligand may bind tightly 
without shifting the melting temperature of a protein, and 
larger T m shifts are observed for more entropically driven 
compound binding (Vedadi et al, 2010). 

Given the quite clear binding of CoA to a protein required 
for capsule biosynthesis, the need for an acetyltransferase in 
the biosynthetic pathway and the suggestive conformation of 
the bound CoA, a detailed kinetic analysis was performed. We 
used three different biochemical assays to demonstrate the 
acetyltransferase activity of Wcbl. None of these showed any 
detectable activity (Fig. 5). Assays were carried out under 
both oxidizing and reducing conditions, as previous studies on 
other acetyltransferases (Dorfmueller et al, 2012) have shown 
that the formation of an oxidized adduct between Cys and 
CoA has an inhibitory effect on catalysis, which can be re- 
activated through reduction. No activity was apparent in 
either environment. This was somewhat surprising, as the 
structure shows CoA protruding into a funnel-shaped cavity 
(Supporting Fig. S4). Furthermore, the acetyl moiety is held 
in place through hydrogen bonds to Cysl4 and His201, 
constraining its conformation and presenting the acetyl group 
in a suitable orientation for reaction. 

Moreover, detailed analysis of the structure suggests that 
there is no available binding site for a reacting molecule. 
Analysis of the structure using four different servers suggested 
that the only likely binding site for a small molecule is that 
occupied by CoA. This is consistent with our observation that 
likely binding partners, and a range of common cellular 



chemicals, do not show any affinity for Wcbl in DSF experi- 
ments. These observations, coupled with the lack of any 
distinguishable acetyltransferase activity, suggest that Wcbl 
might form part of a more extensive enzymatic complex. We 
therefore propose a model in which Wcbl functions together 
with a binding partner that presents the acceptor substrate. 
The nature of the CoA-binding pocket, as a loop-rich cradle in 
between two structured domains (Supporting Fig. S4), makes 
it more amenable to the likely subtle rearrangements that 
would be required to accommodate a presented ligand. In this 
model, we speculate that the complex between Wcbl and its 
partner would transform the novel structure of Wcbl into a 
catalytically active form upon association. Such modularity 
has been proposed as an ancestral trait providing robustness 
against mutation and chemical attack (Hartwell et al, 1999), 
and promoting evolvability (useful for CPS formation and 
evolution in B. pseudomallei). 

Our current investigations are aimed at identifying the 
identity of potential partners and understanding fully how this 
protein contributes to the virulence of B. pseudomallei. 



5. Conclusions 

The B. pseudomallei protein Wcbl, which is essential for the 
virulence of this organism, demonstrates an intriguing novel 
fold. This fold incorporates a previously observed subdomain 
into a larger folding unit, creating a loop-rich cradle -like 
structure. This cradle forms a strong binding site for coenzyme 
A, providing tight interactions for most of the cofactor. The 
two structures of Wcbl presented here, obtained under 
different conditions, suggest that Wcbl allows some flexibility 
in the CoA head group. Soaks with acetyl-CoA reveal that the 
CoA head, and likely the acetyl group, are held between 
cysteine and histidine side chains from opposite sides of the 
cradle in a conformation suggestive for reaction. However, 
extensive experiments on Wcbl have failed to demonstrate 
either physical interaction with, or acetyltransferase activity 
on, available substrate analogues. We therefore propose that 
Wcbl forms a module of a larger complex achieving the 
biosynthesis of the B. pseudomallei capsular polysaccharide. 
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